PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sopim01g100510.0.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum; Lycopersicon
Family TALE
Protein Properties Length: 336aa    MW: 37912.3 Da    PI: 5.1722
Description TALE family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sopim01g100510.0.1genomeCSHLView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox26.51.1e-082703022355
                         SS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHH CS
            Homeobox  23 rypsaeereeLAkklgLterqVkvWFqNrRake 55 
                         +yp++ee+ +L++ +gL+++q+ +WF N+R ++
  Sopim01g100510.0.1 270 PYPTEEEKNRLSEMTGLDQKQINNWFINQRKRH 302
                         8*****************************885 PP

2ELK38.72.1e-13222243122
                 ELK   1 ELKhqLlrKYsgyLgsLkqEFs 22 
                         ELK++L+rKYsgyL+sL++EF+
  Sopim01g100510.0.1 222 ELKEMLMRKYSGYLSSLRKEFL 243
                         9********************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM012553.4E-1774118IPR005540KNOX1
PfamPF037901.2E-1976116IPR005540KNOX1
SMARTSM012566.2E-27124175IPR005541KNOX2
PfamPF037917.1E-25127174IPR005541KNOX2
SMARTSM011881500133153IPR005539ELK domain
PROSITE profilePS5121311.383222242IPR005539ELK domain
PfamPF037892.2E-10222243IPR005539ELK domain
SMARTSM011883.1E-7222243IPR005539ELK domain
PROSITE profilePS5007112.795242305IPR001356Homeobox domain
SMARTSM003891.5E-12244309IPR001356Homeobox domain
SuperFamilySSF466891.11E-19244314IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.601.5E-26247307IPR009057Homeodomain-like
CDDcd000861.79E-12254306No hitNo description
PfamPF059203.2E-17262301IPR008422Homeobox KN domain
PROSITE patternPS000270280303IPR017970Homeobox, conserved site
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 336 aa     Download sequence    Send to blast
MMDELSKLHS AIVCSHSRRQ QEVEVEAEAG PTIINNTTTS FAAVHHHYCQ LEAAVAADHN  60
HHQNNTKSTT NMSDLIKAQI ANHPLYPNLL SAYLQCRKVG APQEMTSILD EISKENNLIS  120
SSRHSSEIGA DPELDEFMES YCAVLVKYKE EFSKPFDEAT SFLSNIESQL SSLCKDNLIT  180
STSFNNYISD EAGGSSDEDL GCEEMEAADS QESPANCEGD NELKEMLMRK YSGYLSSLRK  240
EFLKKRKKGK LPKEARIVLL DWWKNHYRWP YPTEEEKNRL SEMTGLDQKQ INNWFINQRK  300
RHWRPSEDMK FALMEGVSAG SMYFDGSGGT GNIGT*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1237246LRKEFLKKRK
2243247KKRKK
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00670PBMTransfer from GRMZM2G087741Download
Motif logo
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAF3759680.0AF375968.1 Lycopersicon esculentum knotted homeodomain protein 4 (KN4) mRNA, partial cds.
GenBankAF5335970.0AF533597.1 Lycopersicon esculentum knotted protein TKN4 (kn4) mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqNP_001266275.10.0knotted homeodomain protein 4
SwissprotQ9FP291e-101KNOS1_ORYSJ; Homeobox protein knotted-1-like 1
TrEMBLQ7Y0Z50.0Q7Y0Z5_SOLLC; Knotted homeodomain protein 4 (Fragment)
STRINGSolyc01g100510.2.10.0(Solanum lycopersicum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA7532480
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G23380.11e-77KNOTTED1-like homeobox gene 6